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Introduction 



The research described in this report is part of a 

three-stage project in the domain of arithmetic story 
problems. The three stages are: 

(1) definition and explication of schema knowledge 

(2) development and evaluation of an instructional 
system designed to teach schema knowledge 

(3) computer simulation of the acquisition and use 
of schema knowledge structures. 



This document focuses on the first stage only. The 
remaining stages will be addressed in future reports. 

One objective of this research is to understand how 
schema knowledge is acquired and used in the chosen domain. 
In particular, the focus is upon ways in which instruction 
influences the development of specific knowledge structures 
in long-term memory. 

It is common to find schema-based research in cognitive 
science. Much of this work, however $ fails to specify 
precisely the nature of a schema. A schema has sometimes 
been considered to be equivalent to other cognitive 
structures such as frame, script, or plan. Mpstof ten, a 
schema is no more than a simple declarative frame with 
variable slots to be filled* The 3ack of specificity about 
the structure of a schema makes it difficult to describe or 
model how an individual learns, stores* and uses knowledge. 
If the schema is the basic building block of cognition, as 
Rumelhart (1980) states, then it must be more clearly 
defined. This report fccuses on the definition and structure 
of schema knowledge; The adequacy of the definition is 
evaluated through computer simulation programs that operate 
within the domain of interest. 



The following aspects of our research are described 

below. First, the domain itself is subjected to analysis of 
the conceptual relations that may be expressed in arithmetic 
story problems. Emphasis is placed upon the underlying 
semantic structure that gives meaning to each, 
problem. Second, each semantic structure is framed as a 
hypothetical memory object { i .e. , a schema), and the 
relationship to accepted theories of memory is developed. 
Third, a computer model is presented that details the 
linkage of schema knowledge to two basis components of long- 
term memory: semantic networks of declarative memory and 
production systems of procedural memory. 
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Semantic Relations of Arithmetic Story Problems 

A story problem may be loosely characterized as an 
abbreviated verbal account of a situation* providing some 
specific information (usually numerical) and requiring use 
of that information to answer a stated question. The 
information given in the problem is embedded within a 
"story" -, but the story rarely contains much detail. The 
reader is expected to recognize the situation depicted in 
the story and to embellish it from his or her own store of 
experiences. Thus, it becomes important that the reader h^ve 
sufficient knowledge stored in memory to be used as 
necessary in understanding the events of the story. 

What is required to understand a story problem? First , 
the reader must recognize the words used in the problem. 
Second, the reader must understand the situation and the 
relationships that exist among objects described in the 
problem. We do not focus hAre on the problems of reading; it 
is assumed that students have the requisite knowledge of the 
situations portrayed in the problems. Our attention is upon 
what knowledge is required for students to understand the 
abbreviated description of the situation and its 
accompanying relations. 

One aspect of understanding appears to be the 
categorizing of similar itesms. Recognition then becomes the 
process identifying the appropriate category. It is obvious 
that one could organize the domain of story problems on a 
number of different dimensions. For example, problems 
requiring the same operations } could be grouped together • 
Alternatively, problems describing the same situation ( s ) 
could be aligned. Or, as we suggest here, problems 
reflecting similar semantic structure could be aggregated • 



Categorization by operation works well only when a 

single operation is required. This approach is currently 
popular with teachers and textbook, developers, but it has 
limited success as a problem-solving strategy. For multi- 
step problems, it is difficult for students to order and 
keep track of the various operations. 

Grouping by story situation also has its drawbacks. 

There are an infinite number of situations that could be 
used* and memory requirements for keeping track of all of 
them would be enormous. Also, for any situation, several 
problems could be devised, each requiring different methods 
of solution. Hence, the situation alone could not 
sufficiently provide clues about solution strategy. 

We argue here that the most efficient and successful 
means of organization is to look for common underlying 
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elements within the structure of the problems. This approach 
involves features that are usually termed semantic 
relations . It requires the ability to understand the 
situation depicted in the story and to perceive the 
relationships that exist between objects. 

In this section, we describe a set of semantic 
relations that characterize fully the domain of arithmetic 
story problems. Other studies in this domain have either 
limited_their scope to a subset of arithmetic operations 
(e.g., Riley, Greeno, & Heller, 1983; Briars & Larkin, 1984) 
or have foeuseduppn characteristics of the quantities given 
in the problem rather than upon the overall structure of the 
problem (cf. Greeno, Brown, Poss, Shalin, Bee, Lewis, & 
Vitolo, 1986 )i Differences between the current 
conceptualization and those of other researchers, 
particularly Riley et al t and Greeno et al., will be 
discussed later in some detail. 



Five semantic relations appear to_ be sufficient for 
characterizing virtually all story problems of arithmetic. 
These are^ Change * Combine* Compare^ Vary, and Transform. 
Each is described below, together with examples that 
demonstrate the different formulations of problems 
containing the relations. 

The CHANGE Relation 

The first and most elementary of the semantic relations 
is the Change, (CH) relation. In its simplest form, the 
relation is an increment or decrement of a measurable 
resource. The change that occurs is physical and is 
permanent. Once the change occurs, the original state cannot 
be revisited. 

A CH relation can be manifested in many forms. The 
following are examples: 

John had 12 baseball cards. His friend Jim gave (1) 
him 5 more. How many cards does John have now? 

Twenty-five tomato plants were growing in the (2) 
garden. Snails ata some of them. There are 15 
plants left . How many did the snails eat? 

I had some money in my checking account • After (3) 
I deposited a check for $35.00, I had $280.75 
in the account. How much was in the account 
before I made the deposit? 

Each problem begins with an initial state (e.g., 12 baseball 
cards, 25 tomato plants, some money in an account) in which 
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the objects to be manipulated are specified and in which the 
quantity or amount of these objects is defined. A change in 
possession occurs at some later time, indicated in these 
problems by the phrases or words "more" (than he had 
before), "loft" (after eating), and "after" (the check was_ 
deposited). The reader must infer the time constraints from 
the position of the statements and these phrases. Aftet the 
change occurs, there is a recognizable final or ending state 
in which the new quantity is defined. The initial state and 
final stste.are not coexistent: they cannot occur at the 
same. time. Either there are 25 tomato plants or there are 
15. Both statements cannot be true. 

As shown in ex?;nples (1), (2), and (3), three different 
questions can be posed in a Change relation. The most common 
situation is to provide information about the initial state 
and the amount of change, leaving the final state to be 
computed (1). A second alternative is to present both the 
initial and finalstate and_to have the individual calculate 
the amount of change (2). Finally, the problem may contain 
the amount of change and the final state, and the question 
is to determine a value for the initial state (3). 

Most CH problems involve only additive change (e.g. , 
require addition and subtraction operations) . However , 
sophisticated problems exist for which there may be 
multiplicative or exponential growth rather than additive 
change . The following is an example: 

There were 300 bacteria on the petri dish. If (4) 
they doubled in number every 2 hours, how many 
would be on the dish after 6 hours? 

This example emphasizes an important point about the 
semantic relations defined here: They are not defined in 
terms of the arithmetic operations tequired for problem 
solution. We are looking at the nature of the relation 
among objects described within each problem; that relation 
Say itself be linked to several possible operations. 
Identifying the relation is not synonymous with identifying 
the operation for solution; 

The COMBINE Relation 

A COMBINE ICB)^ relation is expressed whenever there 

exists a hierarchical or composite grouping of objects 
within a problem. The CB relation involves the renaming of 
elements with respect to a superordinate category, tfo action 
is taken in a CB relation, and, in contrast to the CH 
relation, no permanent alteration of objects occurs. The 
passage of time is irrelevant. 



Schema Knowledge Structures -4» 



9 



ftn iraportant_characteristic of the CB relation is its 

dependence upon understanding part-whole relationships. The 
part-whole concept occurs in many domains , including physics, 
algebra, and arithmetic (cf. Nesher, Greener, & Riley , 1982; 
Kintsch S Greeno, 1985 J . To understand a Combine relation, 
an individual must comprehend that the whole Cor 
superordinate category) isequal to the sum of the parts 
(subordinate categories!. A necessary constraint is that the 
subordinate categories have semantic ties to the 
superordinate one. The logical^ hierarchical structure must 
have a semantic base. For example, the following item 
illustrates a Combine relation: 

Ann has three apples and two oranges. How many (5) 
pieces of fruit does she have? 

Consider ^hat an individual must already know (or must 
be told elsewhere in the problem) in order to solve (5). 
First, he/she must recognize "apples" - 9 "oranges" and 
"fruit". There must bean understanding of the common 
attributes of these three elements. If they have no shared 
characteristics, the problem becomes senseless. The 
individual must also understand that only those elements 
specified in the problem are relevant. We do not speculate 
about how many lemons or grapes are in Ann's possession, 

There must also be an awareness of the relationships 
among apples, oranges, and fruit, as displayed in Figure 1. 
In this Figure* the two elements "apples" and "oranges" have 
ah identical relationship with the element "fruit": both are 
instances or examples of "fruit". However, "apples" and 
"oranges' nonetheless are distinct semantic elements; an 
apple is not equivalent to an orange. Each element maintains 
a definitive set of characteristics that serves to 
distinguish it from ether elements existing at the same 
level (such as pear* grape, lemon) .Our hypothetical 
inuividual solving ti;e above problem must know that it is 
logically or semantically impossible to join elements -at one 
level of this semantic network unless they have identical 
links to a higher level of categorization. That is, they can 
be combined only if one considers that they are instances of 
a more general level of classification and only if they 
equally share the characteristics of that level . 

In terms of Figure 1, the elements "apples" and 

"oranges" inherit- the characteristics of "fruit" because 
there exist links connecting these elements and because 
"fruit" is at a higher level of the network than the other 
two elements. The meaning of a link depends upon the 
direction in which it is interpreted; Thus, it is true that 
"apples" are an instance of "fruit"; it is not true that 
"fruit" is an instance of "apples". Similarly, "apples" have 
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the properties of "fruit" ? the converse is hot true. In 
general^ links running from superordinate categories to 
subordinate ones are inheritance links , and it is common to 
speak of the subordinate or lower level elements inheriting 
all characteristics of the superordinate one. 

The importance of these semantic distinctions is 

apparent in the following item: 

I have three apples and two oranges. How many (6) 
pieces of candy do I have? 

Prom Figure 1, it is clear that "fruit" and "candy" are 
both instances of the superordinate category "food" , and 
both inherit the same set of characteristics about food 
(e.g.* can be eaten , provides source of energy! . However , 
they do not share subordinate elements. That is, "apples" 
and "oranges" are not examples of "candy" and have no links 
to it.Thiis, problem (6) cannot be solved with the given 
information. 

In general, ebmbine problems do not define the 
superordinate and subordinate categories as such. 
Individuals solving the problems are expected to draw upon 
semantic knowledge stored in long-term memory for 
identification and clarification of the elements specified 
in the problem. If the requisite knowledge is missing from 
the individual 's knowledge base, the individual will be 
unable to solve the problem. 

_As with the CH relation, there are varying forms that 
the CB relation may take in an arithmetic story problem. Two 
different questions may be asked. First, the problem may 
require finding a numerical value associated with the 
superordinate category. In that case, values for the 
relevant subordinate ones must be given in the 
problem. Second, the problem may ask for the value of one 
of the subordinate elements. For this case, the 
superordinate value and the remaining subordinate one must 
be known. An example of the first situation is given by (5). 
An example of the second is given below. 

I have three apples and some oranges. If I (7) 
have five_pieces of fruit, how many oranges 
do I have? 

A distinguishing point about the sematic CB relation is 
that the original quantities associated with the subordinate 
and superordinate categories remain unchanged by the 
combination. So, for example, in (7) above, although there 
may be five pieces of fruit, three of them are 3till apples. 



Schema Knowledge Structures 



n 



-6- 



Figure 1 
A Semantic Network 
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The COMPARE R e4 at i Oil 



A third semantic relation in the domain of arithmetic 
story ^ problems is the Compare (CP) relations In a dP 
relation, two elements of a problem are evaluated in order 
to determine their relative size. The meaning of the 
relation comes from weighing one element against the other. 
It is not the absolute value of either one that is central 
here, but rather the relative position that one has with 
respect to the other; 

A necessary part of the relation is the existence of 
two elements, each associated with a hiimirical. value and 
both having the same semantic features. Generally, the 
semantic features of interest are the units in which the 
elements are measured (e.g., feet, hours, gallons). 
Comparisons can only be made meaningfully between elements 
measured against the same standard. For example, we cannot 
say which is larger, a liter bottle or a twelve inch board. 
However, we can compare a liter bottle with ah eight ounce 
jar, provided that we do so on a common unit of measure. 

Implicit in the CP relation is the concept of one-to- 
one matching of one element in the problem with the 
other. As described by Briars and Larfcin (1984)* each 
element is considered to be a set having a given number of 
members; To compare two sets, one theoretically engages in 
one-to-one matching* removing one member from each set and 
setting them apart as a matched pair; The smaller of the two 
sets is the one which first becomes empty. The amount left 
in the larger set is the difference between the two sets, if 
both sets become empty at the same time, they have an equal 
number of members. 

Much like Combine* the Compare relation is static; no 
action occurs in the problem. The CP relation is a 
description, an alternative way of expressing the relative 
size of two sets of similar objects. The representation of 
time* as a variable is usually irrelevant. CP relations can 
occur at the same time or at different times. For example, 
consider the following items: 

Joe makes §4.50 per hour at his job/ and Ed (8) 
makes $5.30 per hour. How much more does 
Joe make per hour than Ed? 

Mary is a gymnast; Last week, she scored (9) 
7.5 on the balance beam in a gymnastics meet. 
In another competition held yesterday, 
she received a score of 8.5 in the same 

event. At which meet did she have the best 

performance? How much better? 
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In (8) , the comparison takes place at a single point in 
time; both individuals currently make the wages stated in 
the problem • In ( 91 , the comparison is between scores 
obtained at two different times. It should be clear from 
these examples that time is not a distinguishing 
characteristic of CP, 

The VARY Relation 

The Compare relation introduced the relationship of 
relative size. In the Vary (VY) relation, there are several 
other relationships that must be understood by an individual 
in order to solve problems having this semantic structure. 
Host importantly* it is necessary to distinguish between 
three types of _elements: subject-units , object-units* and 
associations. Subj ect-un i£g are the foci of the problem 
(e.g., marbles, apples, children) . A subject-unit has a 
particular obiect-unii: related to it by means of a specified 
association . For example^ the statement that "one apple 
costs 25 cents" has one apple as the subject-unit, cost as 
the association, and 25 cents as the object-unit. The 
association relating subject- and object-units is general 
and applicable to every subject-unit; thus, any instance of 
apple will have a cost represented by cents that is 
associated with it (in the restricted environment of this 
particular problem, of course). 

Asecond relationship requisite to the VY relation is 
the concept of pes— uait . The notion of a constant value 
per unit may be explicitly stated or may be merely implied 
by the wording of the probienu In either case, the 
individual must realize that every instance of the subject- 
unit presented in the problem will have the same value of ah 
object-unit associated with it. Thus* we have "ah apple 
costs_25 cents"_or "thecar travels_30 miles on a gallon of 
gas" as examples, knowledgeable students understand without 
being told directly that a second apple will also cost 25 
cents and that another gallon of gasoline will enable the 
car to travel an additional 30 miles. Within a problem, the 
per-unit value remains constant. 

A fundamental difference between VY and the three : 
relations previously defined is that the problem structure 
of VY involves four quantities; two of these are subject- 
units and two are object-units. As described above, a 
subject-unit ii paired with ah object-unit by means of an 
association. For the four quantities describing a VY 
relation, there are two pairs, each bound together by an 
association. Not only must an individual recognize the pairs 
and the associations that bind them, the individual must 
make a mapping from one pair to the other and test the logic 
of that mapping before solving the problem. 
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a >! 14 

ERIC 



Let the first subject-object pair be denoted by 

[subjectl-unit <association> ob jectl-unit ] (10) 

and the second by 

[subject2-unit <assbciation> ob ject2-unit ] . (11) 

Each of the four units of (10) and (11) has two features: a 

£ype and a value . The type refers to the nature of the 

elements, such as aggies , pencils , etc. The value is the 
number of such elements (e.g., 2 apples). 

To satisfy the conditions of the VY relation, an 
individual must test three. constraints; that the types of 
subject-units are identical in expressions |10) and (11), 
that the types of object-units are identical, and that the 
associations described in (10) and (112 are the same. 
Expressions {10j and (ii) should differ only in the 
numerical values associated with the four units^ Further , 
in a typical story problem, three of the four units will 
have known numerical values. In a VY relation, the objective 
is to determine the fourth (unknown) value. 

This constraint evaluation can best be demonstrated by 
the following example: 

The price of one apple is 25 cents. How much (12) 
will 15 apples cost? 

The expressions of (10) and (II) can be rewritten as: 

[1 apple <cost> 25 cents] (13) 

and 

[15 apples <cost> =?= cents] . (14) 

where =?= denotes an unknown value. To solve this problem^ 
ah individual first must establish that a logical structure 
exists. The three tests described above serve this purpose: 
expressions (13) and (14) both concern apples, both involve 
the cost of apples, and both measure cost in terms of 
cents. The importance of these tests is clearly seen by 
examining the_ following problems, in which one or more of 
the tests fail. 

The cost of one apple is 25 cents. How much (15) 
will 5 bananas cost? 

The. cost of one apple is 25 cents. How much (16) 
will 15 apples weigh? 
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The cost of one apple is 25 cents. How much (17) 
will 5 bananas weigh? 

Problems (15 J and (16) demonstrate the failure of a 

single test; problem (17 J shows two failures. The more tests 
that fail, the more illogical the prdblem_appears to an 
individual. Note that each sentence of_(17) is a reasonable 
statement. The difficulty is that the first cannot be used 
to answer the question posed in the second. 

The TRANSFORM Relation 

The fifth and final relation defined here is the 
Transform (TR) relations The essential understanding 
required for a TR is that it is possible to describe one 
object in several different ways* in particular, if the 
object has a numerical value associated with it and if it 
bears a known relationship to another object also having an 
associated numerical value, one may describe the first 
object in two ways, 'in its original metric or as a function 
of the value of the second object. 

Consider a typical Transform problem: 



Sue is 1/3 as old as her mother. If her mother (13) 
is 30 years old, how old is Sue? 

There are two ways to look at Sue's age in this problem. 
First, she must be some number of years old. This is the 
unknown of the problem. Second, since both her age and her 
mother's age can be expressed as years, wecan look at one 
as a function of the other. In this case, Sue's age is 
related to her mothers by the fraction 1/3. Thus, there are 
two statements about Sue's age and both are simultaneously 
true. J 

The TR relation is similar to VY in that two 

relationships are given in the problem. These may be 
expressed as: 



[subjectl *=* objectl-unit ] (19) 

and 

[subject2 *=* object2-uhit j (20) 

with. subject! and subjects being the main foci of the 
problem* with *=* indicating that the leftmost member of the 
expression can be expressed in terms of the rightmost 
member, and with pbject-uhits having types and values as 
described above. We also expect to have a known relationship 
between the two subjects: 

[subjectl *=* <fl> subj*»ct2]. (21) 
Schema knowledge Structures - -11- 
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This expression states that one subject can be expressed as 
a function_<fl> of the second. Finally, by substitution from 
(19) and (20) we have 



[objectl-unit *=* <fl> object2-unit ] (22) 

that is, the first object-unit can be expressed as a 
function of the second object-unit (or vice versa) . 

For a problem to contain a Transform relation, several 
conditions must be met. First , subjectl and subject2 must be 

recognizably distinct entities (e.g. , two individuals) . 

Second, the two object-units must be expressed in the, same 
metric (e.g., years, feet, days). Third, either the subjects 
are related to each other through a stated mathematical 
function or the object-units are so related. Fourth, the 
problem must provide the value of on 

value of the mathematical function (such as "3 times as 
large 1 * or H 5 more than*), or it must specify values for both 
object-units, leaving the mathematical function to be 
determined. 

Using this notation, problem (18) can be represented by 
the following expressions: 

[Sue*s age *=* =?= years] (23) 

[Mother's age *=* 30 years] (24) 

[Sue's age *=* <l/3> Mother's age] (25) 

[=?= years *=* <l/3> 30 years] (26) 



Expressions (23) and (24) state that both Sue's age and her 
mother's age can be expressed as some number of years. The 
number of years is unknown for the former and is given as 30 
for the latter. Expression (25) indicates the relationship 
between the two ages: Sue's age is one-third of her mother's 
age. Expression (25) is actually the solution to the problem 
in this case, and it is obtained by substituting the values 
of the ages from (23) and (24) into the appropriate slots of 
(25). 



An important underlying concept of a TR relation is 
"unity" or "the whole". Implicit in the definition of 
relationship between two quantities is the notion that one 
can frequently define one quantity to be "the whole" and 
express the second quantity as "a part" or "a multiple" of 
the whoie A This is a more sophisticated use of the part- 
whole concept than was needed for the Combine relation. Any 
quantity can be designated "one" or "unity" and any similar 
quantity can be expressed in relative units. The primary 
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SSI 111**1 ls T fc hat both quantities must be measured by the 
same metric. In the age problem above, age is given in 
years. As written, the relation between the two- ages is that 
£- a*** 13 a fraction of her mother ' s 121. The mot fill ' s 
I? 6 whole" and Sue's age is expressed SI "a plrt" of 

lthout changing the relationship between ages, one 
fn S ^w- e lfc as "her mother is three times II old Is 

"whole" aSS JhS'Sih "?-- 8 i9e " OU ^ d be ^siderel the 
whole and the mother's age would be a multiple of it. 

«rosi2P ™ relation is static. No action takes place in the 
problem, chere are no alterations in quantities. In fact 
™,° PP 5 S - te is . trues exact conservation of guantitifl is 
required in moving from one unit of measure to anothl? 
fo^l? 6 -' TR P r °hlems, time is here and now. The 

JhiS £??? hXp ! that are expressirare true at this moment 
they will not necessarily be true at a later date 7? f 
consider the age problem above) . (*•••# 

one answlf^o^f^f^fi 1 " 10 ° f . Tr ansform problems is that 
on© answer to ttte question posed in the problem is 

IS?" C ^i y stat f d ' Por (18)rthe question is "Sow old is 
Sue?" One acceptable and true answer is given in the fifah 
sentence: "sue is 1/3 as old as her mother?" To force ah 
individual to seek the alternate representation o! lue's 

ylars Sld P II b i^^ h S Ul ^ 1^1 the <3»estion as "How many 3 

, Ue? An lndl vidual solving the problem is 
will be e»ri2SSa f • ^ previous experience that the solution 
JSlIi;-- -I- f se in y ears ' even though the explicit 
question about years is not stated. p c c 

r- 1 o-? here are several ties between TR and other 
If?t-££2?; -ftl?^ b u- h Tr ansform and Combine rely upon the 
pa?t-whole relationship. Second, both Transform and Corneal 
are concerned with the relative size of quantities Thifd 
on a ^f?s m ol n fhIlf y b ° th inVOlVS f ° Ur ^antiti^s^^peFating 

«*-„Hon? e ? relation appears to be the most difficult for 

hi?? ?? ne ° USly conside red in TR. Students may nSt realize 
that it is necessary to satisfy many constraints as thev 
seek to recognize the form of a problem; Wofking with I* 
single constraint may lead to an incorrect representation 
and a consequential incorrect solution, furthermore TR is 
a?e ^iffflSi^^ t0 «*«*ad operational^ 



Schema knowledge Structures 



IS 



-13- 



Multi-Step Problems 

Thus far, we have used only simple examples from 
arithmetic in which a solution may be found by a single 
application of one arithmetic operation. Most individuals 
hive little trouble with such items, once they have mastered 
the algorithms of the operations themselves. Rates o£ 
success undergo a dramatic shift when problems require more 
than a sinale computational step. Even highly qualified 
itSlilitS SI arithmetic experience difficulty with multi-step 
problems (Marshall, 1987). 

ft particular advantage of the semantic relations 
introduced here is that they may be used to organize multi- 
step problems. For any problem, there is one central 
Question that is posed and one general situation that is 
described. Within that situation, there may be other 
unknowns and other situations that must be examined, but the 
central one remains the target of the problem solving. For 
example, we can easily construct a problem in which the 
central situation is that an individual has some money, 
makes some purchases, and has a resulting amount of money 
that is less than the original amount. For example, 

Alice had $50.00 when she went to the grocery (27) 
store. She bought two quarts of milk at $.75 
each* 1 1/2 pounds of cheese at $2.75 per pound, 
and a loaf of bread for $1.39, How much money 
did she have after she made these purchases? 

This situation expresses a Change relation. Embedded 
here is a Vary relation (in the case of purchasing more than 
one item for the same cost per item) and a Combine ^l^ion 
(in the case that the individual purchased several items 
each having a given price). 

Many individuals use an operation-based strategy to 
solve both simple and difficult problems (Marshall, 1982). 
Keeping track of the many operation-- proves to be difficult 
for a large number of them, we speculate that this 
difficulty arises because the individuals do not have a 
means by which they can organize the many steps required in 
the problem. 

It is true that one could read problem (27) and make a 
mental note that one should multiply (2 x $.751, multiply (1 
1/2 x $2.75), add (results of the two multiplications plus 
$1.39), and subtract ($50.00 minus the sum resulting from 
the addition step) . One can hypothesize that an individual 
attempts to store this list of operations _ in short-term^^^ 
memory while working the various computations. Since these 
olerations are not logically bound one to the other, some 
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may become distorted or lost. The result is that one or more 
of the operations, is frequently omitted, yielding an 
incorrect or partial solution. We have empirical evidence 
that omission errors are, in fact, among the most common 
errors oh multi-step problems . 

The use of semantic relations introduces a logical 
structure that serves to organize the information in the 
problem. In (27) , once the .individual recognizes that the 
underlying situation depicts a change relation, he or she 
can then look at the components that make up the relation. 
The initial or starting quantity is already known (i.e.^ 
$50,00). The amount of change is not yet known — this is a 
secondary problem that must be solved in order to complete 
the Change relation. 

To solve .the secondary problem, the individual 
perceives that several items are to be purchased and that 
the total cost of these items is required. This represents a 
Combine relation, and the needed elements are the various 
prices. Once again, all components of the relation are not 
known. In this case, there are several items with per-unit 
prices. Again, this represents a problem within a 
problem. We are now at the third embedding level. The 
structure of the problem can be diagrammed as in Figure 2. 
Using semantic relations in this way imposes a hierarchical 
structure on the problem-solving steps and thus should 
enable individuals to monitor these steps. 
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Figure 2 
A Multi-Step Problem 



Alice had $50.00 when she went to the grocery 
store. Shi bought two quarts of milk at $.75 
each, 1 1/2 pounds of cheese at §2.75 per pound, 
and a loaf of bread for $1.39. How much money 
did she have after she made these purchases? 
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The Adequacy of the Clas sification 



The five semantic relations presented above appear to 
be sufficient for classifying virtually all story problems 
of arithmetic. Marshall (1985 J examined all sixth-grade 
arithmetic textbooks adopted for use in California public 
schools. In that study, ea«h story problem was classified 
according to the relations defined here. All traditional 
story problems could be uniquely classified^ Problems that 
were not classified were those involving memorized formulas 
(e.g., find the circumference of a circle, what is the area 
of the triangle) and were not strictly arithmetic. 

For the present study, we evaluated three additional 
sources of problems* each representing a different level of 
arithmetic. These were; arithmetic texts for eighth grade, 
remedial arithmetic materials for community colleges, and a 
newly created text for training navy personnel. The results 
were similar to those found by Marshall (1985) -. 
Approximately 90% of all items from the three instructional 
sources could be uniquely classified according to the five 
semantic relations. The remaining 10% required application 
of geometric or probability formulas for solution and 
involved little semantic interpretation. As expected, a 
somewhat larger number of items involving geometry and 
probability were found in the present study than were 
observed at the lower grade because these topics are given 
greater weight at the upper grades. Therefore, we have a 
larger proportion of unclassif iable items, labeled "other" 
The frequencies with Which each relation -occurred in one, 
two, and more than two step problems are shown in Table 1. 

The two eighth-grade texts contain a total of 629 story 
problems; .178 (76%) of them illustrate a single relation. 
The Navy training materials contain only 35 story problems. 
Most of the problems are simple one-step items (70%) . 
Finally, the remedial materials for community college use 
had 658 story problems, and 68% of these were single step 
items as well. 

A majority of the items (71%) in these three Sources 
consists of simple single-step story problems that can be 
solved by application of one arithmetic operation (see Table 
la). A large number of them contain either a Vary or a 
Transform relation (596 of 935 items, or 54%). Only 15% bf 
all items required use of two different semantic relations 
(excluding "other"). Also, 18% of the two-step problems were 
merely repetitions of the same relation. Table lb contains 
the frequencies with which various pairings occurred. 
Finally, a very low 7% of the items contained as many as 
three semantic relations (see Table lc). 
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Table 1 



Problem Classification by Textbooks: 
The Frequency with Which Semantic Relations Occur 
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Table 1 continued 



C. Problems with more than two steps 



8th Grade Navy Training College Tota] 
Texts Materials Remediation 



Number of problems 45 
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The only problems that we were unable to categorize 
*^re itesas requiring application of formulas * These are 
represented in Table 1 as "other". In many instances, it 
would be possible to reduce these also to the semantic 
information contained in the formula. However , we suspect 
that most students apply the foroulas without deriving them 
each time as would be necessary if the semantic information 
were to be used. 



We have no reason tn believe that these instructional 

sources are atypical of arithmetic texts in general. The 
findings are consistent with the classification made at the 
sixth grade (Marshall, 1985). The evidence is strong that a 
preponderance of attention is devoted to solving simple 
story problems at every level of arithmetic. 

O ther Research -Aboat-Story-Problesag 

Researchers in the fields of cognitive psychology and 
mathematics education have developed several approaches to 
the classification of story problems. These may be loosely 
grouped into "structure research" in which the underlying 
relationships are of interest , and "operation research" , in 
which the arithmetic operations themselves are the foci. 

Structure Research 

Semantic St^u^tuxey The major thrust of research about 

semantic relations has its origins in the work ofRiley, 
Greeno, and Heller (1983)^ Most recently , this line of 
research has been extended by Carpenter {1987); Three 
semantic relations were defined in these studies: Change, 
Combine, and Compare. These are similar to but not identical 
with the relations defined here. 

Riley et al. opted to study a limited subset of the_ 
domain of arithmetic story problems, specifically problems 
requiring bnly_a single operation of addition or subtraction 
for solution. This dependence upon operation turns out to be 
critical. As we pointed out earlier, some of the semantic 
relations can be present in situations that demand any of 
the four arithmetic operations (i^e.. Transform). By 
excluding the operations of multiplication and division, one 
fails to observe the broad structure of the semantic 
relations . Further , one fails to perceive that semantic 
relations truly are operation-free. 

In the Riley etal. approach, the Chang* and Combine 
relations have the same general structure as presented 
above, but we have defined them with greater specif icity and 
have introduced additional constraints. As Riley et al. 
pointed out, most of the problems in arithmetic tnot express 
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CH and CB relations involve only the operations of addition 
and subtraction, and their structure is well understood. We 
have foundr however, that there are also multiplicative 
Change problems, although these are relatively rare. 
Consequently, we hesitate to use- operational labels or 
constraints. 

Our conception of Compare differs substantively from 
that presented in Riley et al. By their classification, both 
of the following items demonstrate the Compare relation: 

Joe has 8 marbles. Tom has 5 marbles. How (28) 
many marbles does Joe have more than Tom? 

Joe has 3 marbles. Tom has 5 marbles more (29) 
than Joe. How many marbles does Tom have? 

Under our theory, only the first of these two items is a 
Compare yroblenu The second is an instance of Transform. 

We find that the structure of (29) is more similar to 
that expressed below in (30) than to (28). 

Joe has 3 marbles. Tom has 5 times. as many (30) 
marbles as does Joe. How many marbles does 
Tom have? 

Problems (29J and (30) have the same basic structure. In 
each case, we_ know the number of marbles that Joe possesses, 
and we know that Tom has some number of marbles that can be 
expressed in terms of Joe's marbles. In both problems, the 
objective is to use the given relation between the boys' 
marbles to determine the actual number of marbles owned by 
Tom, The particular arithmetic operation required is 
irrelevant to our understanding of the problem. 

In this instance, the apparent similarity between (28) 
and (29) is an artifact of the limited domain. If we work 
only in the domain of addition and subtraction, Transform 
problems appear similar to Compare ones because the words 
"more than" and "less than" appear, just as they do in 
Compare items. However, these words themselves do not define 
the Compare relation: it is possible to find them in every 
semantic relation. 

Compositional Structure. A very different 
conceptualization of: structure has been developed by Greene 
et al. Under this agprpach, a problem is characterized by 
the types of quantities to be found in it (p. 9). Four types 
of quantities may etfist in a problem: extensive, intensive, 
difference, and factor^ Extensive quantities are simply the 
number of units of some object, such as "5 apples". 
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Intensive quantities are per unit values, such as "5 words 
per sentence", ft factor is defined as "a unitless quantity 
that relates two other quantities that have the same units" 
(p. 9). For example, if "Tom has 3/4 as many marbles as 
Joe" , 3/4 is a factor. Finally, ^difference is "ah additive* 
relation between two quantities of the same type" (p. 9). In 
the statement "Tom has 3 more marbles than Joe has", the 
difference is 3-more . 

The importance of these four quantities lies in the 
ways they may be combined in a problem. Two general 
compositions are possible: additive compositions and 
multiplicative compositions. Additive compositions involve 
only extensive and intensive quantities. Multiplicative 
compositions may contain all four types of 
quantities. Greeno et al. specify rules by which 
compositions may be formed # and they determine the 
operations used to solve the problems by the types of 
quantities found in each composition. 



ft central focus of the research Is to make a graphical 
representation that characterizes the operations. Diagrams 
are constructed to represent the compositions and the 
quantities described in the problems. To solve a particular 
story problem, it is necessary to identify the type of 
composition and to map the quantities involved in the 
composition into the appropriate diagram. 

Compositional analysis differs significantly from 
analysis based upon semantic relations. First, compositional 
analysis has its basis in arithmetic operations (as in 
ad44 tive and m ultipl i cative compositions) • Second, the 
categories of analysis have no meaning when 
the quantities. That is, general descriptions of situations 
seem to be irrelevant, in contrast, these general 
descriptions are used to form the categories of Change, 
Combine/ Compare, Vary, and Transform. Types of numbers or 
arithmetic operations are secondary to the general 
description. 

Operation Research 

Again, there are several different approaches that have 
been taken. We describe two that seem to be particularly 
influential and relevant . These are the efforts tpclassify 
problems according to their surf ace features an classify 
them according to particular uses or meanings associated 
with the arithmetic operations. 

Surface Structure. There exists a reasonably large body 
of research devoted to mapping the structural variables that 
occur in story problems and to assigning difficulty 
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parameters to these variables. For example, Loftus and 
Suppes (1972) examined item characteristics such as number 
of words, the number of sentences, the number of operations, 
the type of operation^ or the similarity to the last- 
presented item. Multiple regression techniques were used to 
determine which of these or similar features account for a 
large proportion of variance in student responses. 

While studies such as this one are undeniably 
interesting in revealing which characteristics of a problem 
inf luencethe difficulty level of the problem, they 
nonetheless have little or no direct bearing upon how 
students learn to solve story problems. The difficulty is 
that these analyses are based upon external features of the 
problems having little to do with how individuals understand 
the problems. Semantic relations are internal features which 
an individual may relate to knowledge stored in his or her 
long-term memory. 

Bse^ClasBea^ Oaiakin and Bell 119831 give a persuasive 
argument against employing operations to classify story 
problems. Their thesis is that operations have more than a 
single meaning or use . For example, addition may imply both 
a-Pttfcfeinq together and a shift . The first of these is 
closely akin to the semantic relation of Combine . The second 

corresponds to Change. The value of Usiskih and Bell's 

approach is that they maintain an_ emphasis upon operations 
by changing the focus from algorithms to applications. In so 
doing, their approach and ours become compatible. They 
demonstrate the heed to evaluate the different uses to which 
the various operations can be put, and we demonstrate the 
need to perceivethe whole picture as embodied in the 
semantic relations . 

While compatible, the two approaches are not 
synonymous. Some of the more important differences can be 
seen in Figure 3. All of the uses defined by Usiskin and 
Bell for the operations of addition, subtraction^ 
multiplication and division can be mapped into the five 
semantic relations of Change, Combine , Compare, Vary, and 
Transform. Most of the uses are evidenced by a single 
relation. In some cases, a single use might beexemplif ied 
in more than a single relation (e.g., the ratio use of 
division maps into both Compare and Transform). 

In the use classification, each operation is expanded 
into several uses. In the semantic relations 
classifications, several uses are combined into a smaller 
number of relations. The important distinction between the 
use classification and the semantic relations classification 
is that the latter cuts across arithmetic operations. For 
example, the Change relation can require addition, 
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Figure 3: The occurrence of Usiskih and Bell's use classes of 
arithmetic operations within the five semantic relations. Uses 
falling in the intersection of two relations can occur in either 
one. Operations associated with each use are given in parentheses. 



subtraction/ or multiplication (as explained previously), 
and it can demonstrate the addition use of shift or addition 
from subtraction, the subtraction use of shift or recovering 
addend, or the multiplication use of size change. 

We suspect that Usiskin and Bell's taxonomy will prove 
to be especially useful in the next phase of our research, 
the instructional system. In particular , it seems reasonable 
that their definitions of use will be valuable in 
explicating the procedural portion of a schema that derives 
from a particular semantic relation. Thus , »e view the 
schema as broader than either the relation or the use, and 
it is capable of incorporating both classifications in a 
meaningful way. The following section describes the nature 
of a schema and its importance as a general memory structure 
for semantic relations. 
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Schema Knowledge: Some Theoretical Considerations 

Before describing a theory of how schema knowledge. is 

stored in human memory^ wefirst discuss a general model of 
memory^ Like many other researchers in thefield, we posit 
two types of long-terra memory ( LTM) : declarative and 
procedural, There seem to be clear distinctions between 
these two typed of memory. Declarative memory contains 
factual knowledge and knowledge of specific events and 
experiences; It is usual to_ assume that this body of 
knowledge is stored in MM as one or more semantic networks, 
linked together with various degrees of association (see for 
example Anderson , 1983). Through knowledge stored in 
declarative memory , one can answer questions about who , 
what , where, and when . 

A second type of memory contains skill knowledge rather 
than factual knowledge. This memory is called procedural/ 
because it consists of sets of general procedures or rules 
for performing various skills. tJniike declarative memory/ 
procedural memory is highly generalized and hot constrained 
by specific instances or experiences. For example* one uses 
the same skill of grasping an object with one's fingers in a 
large number of different situations; The same procedures 
are utilized* adapting to each situation as necessary. 
Consequently, one may grasp a pencil* a rock/ an apple, or 
another person 1 s hand without requiring different rules for 
how to perform each task. A common 3et of rules applies . 

One reason for distinguishing between declarative and 

procedural knowledge Is that the storage mechanisms and 
retrieval mechanisms seem to be different; Adding knowledge 
to declarative memory is relatively easy, and there are many 
"memory tricks" available to help individuals learn 
declarative facts. For example, individuals can learn a list 
of unrelated words by encoding thfm into meaningful 
sentences (Bower & Clark, 19691 i Simple repetition often 
results in rote learning of declarative information (Rundus, 
1971) . Encoding such as this, of course, does not insure 
that the newly acquired knowledge is linked with other, 
related knowledge. 

One may also acquire highly salient declarative 

knowledge directly without repetition or guided mnemonics* 
For example, a single experience of an earthquake is often 
sufficient to establish quite a bit of declarative knowledge 
about the phenomenon (e.g., noise, shaking). 

In contrast, procedural knowledge is difficult to 
acquire and apparently takes a great deal of practice. As 
Anderson (1982) points out* many skills take 100 or more 
hours to acquire. Many motor skills Have this 
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character istic, for example, learning to type or learning to 
write. A number of cognitive skills have the same feature, 
such as learning to multiply, learning to solve physics 
problems, or learning to program computers. 



Given the many distinctions between these two types of 
LTM knowledge, it becomes important to ask how they are 
related. Clearly, information of one type calls upon 
information of the other. By what mechanisms are these two 
forms of memories united? We suggest that the union occurs 
to a large extent through the acquisition of a schema . 

A schema is a knowledge structure that contains 
information about how we interact with the environment in a 
recognizable situation. As such, it contains necessary 
information about how to recognize the situation and what 
action( si we might take under the circumstances . Under this 
definition, a schema becomes the organ structure 
that governs our functions in everyday experiences, drawing 
upon components of both declarative and procedural memories. 

. An individual is perceived as an active processor of 
information, using incoming sensory stimuli and previously 
stored knowledge to make sense of the world. A schema is 
invoked whenever an individual must formulate a response to 
his or her environment . The individuals responses in 
different situations are governed by the schematic knowledge 
available to the individual. This necessity for a response 
differentiates a schema from other hypothetical knowledge 
structures such as plans (Sacerdoti, 1977) or frames 
(Minsky, 1975) which require no action. 

It is reasonably common in artificial intelligence, 
cognitive science, and cognitive psychological research to 
define a schema in terms of at least some of the following 
components: (1) a declarative store of factual knowledge 
relevant to the schema, (2) a set of conditions that must 
exist if the schema structure fits the experience, {3} i a 
means of setting goals for satisfying schema constraints, 
and (4) a set of rules that can be implemented once the 
schema structure is accepted. We argue here that all four 
components are necessary. 

For any schema, there will be a body of accompanying 
facts that. describe the generic case of the schema. A much- 
used example is the restaurant schema / for which there are 
details about definition and structure (e.g. , a restaurant 
is a place where one goes to purchase food, one typically 
eats the food at the same location, one sits at a table on 
chdrs or benches, and so on). In the specific instance in 
which the schema is used, more details will be added from 
the current situation. 



Schema Knowledge Structures -26- 




- ft schema will also have a set of conditions that must 
be met for the schema to apply, in any given situation. These 
•are applied to a description of thesituatidn. When the 
conditions are not met, the schema cannot not be used to 
explain the situation. These conditions may involve the 
invocation of other schema structures that are prerequisite 
to the current one. To continue the example of the 
restaurant schema, the preconditions are details such as the 
establishment must serve food , the food must be for sale, 
there must be chairs, tables, waiters, menus, cooks, etc. 

These conditions may be used in either top-down or 

bottom-up processing. Suppose, for example, that you have 
entered a building and are standing in a room. You will take 
in sensory information in order to determine just where you 
are - If you see several tables at which people are seated 
on chairs eating a meal, you will likely call upon the 
restaurant schema to help you interpret ail of the details 
you are processing. If you see many rows of chairs, all 
facing one direction, you probably search for another schema 
for clarification, such as alecture hall or a movie 
theatre. This is bottom-up processing — one takes in 
details and tries to interpret them by means of an existinq 
knowledge structure. * 

Now consider top-down processing. You desire to go to a 
restaurant for a meal. You walk into a building. You now 
look for confirmatory eyidencethat you have found a 
restaurant, so you search for details such as tables and 
chairs, individuals serving food, and so forth. In this 
case, the knowledge structure directs your attention to 
specific aspects of the situation. You may not pay attention 
to the fact that there are paintings on the wall or an 
orchestra in the corner. These are not central confirmatory 
conditions for the restaurant schema . 

A third feature of schema structure is a mechanism 'or 
setting goals in the problem-solving process. In this 
component reside the rulfs under which goals are generated, 
ordered, and satisfied. For example, in the restaurant 
schema, several associated goals involve deciding wh^t type 
of restaurant is preferred, how to reach the restaurant, or 
when to go to the restaurant. The satisfaction of these 
goals cay require additional calls to knowledge found in 
another schema^ For example, if one finds one has no cash, 
one needs another schema which might be called "how to pay 
for things without cash". Now information regarding credit 
cards, checks, or 100' s becomes important as well as the 
circumstances in which they are reasonably used. 

The final aspect of schema structure is a set of rules 

that governs an individual's response to the situation that 
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invoiced the schema. Thus, once you have recognized that you 
are in a restaurant {and hence have invoked the restaurant 
schema), your next action will be directed by conditions 
that a;;e part of the schema (e.g., you wait to be seated, 
you order food from the menu, etc.). 

Schematic knowledge drives cognitive processing. This 
function. gives the schema a different stature than knowledge 
previously defined as procedural or declarative. In essence, 
the schema sits on top of these other LTM structures. In our 
model f we see a schema as an overlay that encompasses 
elements of both procedural and declarative knowledge (see 
Figure 4) . As such, it operates as the controlling mechanism 
in information processing. It determines which procedures 
and which semantic networks are to be accessed. 

How is a schema accessed a It must direct 

our recognition processes in a top-down fashion. There are 
many examples in the research literature of individuals 1 
ability to recognize degraded stimuli, especially when 
primed. This priming presumably activates a schema skeleton 
that then directs searching and pattern matching. Degraded 
stimuli are perceived as being adequate fits only if the 
schema is successfully activated. 

We must also consider bottom-up activation of a schema. 
If a number of different features of a situation are 
observed, they may work together to activate the schema 
skeleton. However, much work in psychology suggests that 
humans usually attempt to recognize situations and to work 
in a top-down fashion (c£. Anderson, 1983) .This is closely 
connected to goal-directed behavior. We have expectations 
about what we expect to experience in our daily lives. Each 
expectation takes the form of a schema-- so, for example we 
expect to meet with students in bur offices, we expect to 
open our mailboxes arid receive mail. We do not wait until 
individuals come into a room with us to determine that we 
are at work and that these are students with questions. 

The point about top-down and bottom-up processing is 
important for understanding learning and instruction. While 
admitting that we function in primarily a top-down fashion, 
many psychologists and educators expect learning to be a 
bottom-up process. That is, a schema is acquired by first 
solidifying the declarative and procedural components in 
LTM. Eventually, these elements become interconnected, and a 
schema skeleton emerges. A major theme of the present 
research is the challenge of that position: we suggest that 
schema development may be a top-down process also and that 
instruction ought to take advantage of this aspect of 
information processing. 
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Figure 4 

The relationship between schema knowledge, 
declarative memory, and procedural memory 




Procedural Declarative 
Memory Memory 
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Related Schema Research 

Two alternative views of schema knowledge have their 
origins in studies of reading arid text processing. One has 
its origins in Bartlett^s (19321 study of comprehension and 
focuses on the nature of stories (Stein, 1982), The second 
arises from a new theory of cognition called parallel 
distributive processing (Rumelhart, McClelland and the PDP 
Research Group, 1986). 



Many cognitive psychologists credit Bar tlett ' s ( 19321 
study of text comprehension as the earliest formulation of a 
schema. Bar tlett presented his subjects with a story (the 
most well-known being "The War of the Ghosts" J and asked 
them to reproduce it. Subjects generally changed. and 
distorted stories according to their own cultural 
experiences and conventions. Bar tlett hypothesized that each 
subject had an abstract story representation or schema that 
he or she used to interpret and understand the stories. 

Stein 11982} developed a schema-theoretic approach 
based upon Bar tlett* s conception of schema* Her research 
emphasizes the elements of stories and the relationships 
(such as causal links) that occur. She also examines which 
elements individuals recall and how the form of a story 
influences recall. The work of Stein and her colleagues 
demonstrates the schema nature of stories and the importance 
of story structure. 



Stein" s research centers on the structure of stories 
rather than the organization of memory that individuals must 
have in order to understand the stories; A primary 
difference between her work and ours is that her focus is 
upon the organization of the story and burs is on the 
organization of the memory processes that are necessary for 
understanding the story .Consequently, she defines a schema 
in terms of story features such as setting, different types 
of episodes, and causal relations. In contrast, we present a 
general definition of a schema that specifies the type of 
knowledge contained in the structure and the ways in which 
that knowledge can be used by information-processing 
mechanisms. This conception of schema applies to general 
experiences as well as to stories. 

A different schema-theoretic approach has been taken by 
Rumelhart and his associates (1980, 198 5). A central 
distinction between Rumelhart* s view and the one presented 
in this report is the difference in the conceptualization of 
long term memory^ We adhere to the declarative/procedural 
model; the PDP group holds a model of connected units. 
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Rumelhart U980) developed the following set of schema 
characteristics: (1) to have variables, (2) to have the 
capability of being embedded in other schema structures, (3) 
to represent multiple levels of knowledge from abstract to 
concrete, (4) to represent knowledge rather than 
definitions, (5) to be an active process* and (6) to be a 
recognition device (p. 40-41}. This Characterization is 
important in that it specifies what a schema does; ic lacks, 
however, definition of the structure of the schema. 

More recently, Rumelhart and his colleagues have 

developed the notion of a schema Within parallel 
distributive processing theory (Rumelhart et al. , 1986; 
McClelland et al., 1986). Their conception differs greatly 
from^the one presented in this report. The major difference 
is that a schema is not considered to be ^stored memory 
structure but rather is a collection of activated "units" 
that become activated simultaneously. As such, a schema need 
not be the same each time it is required: depending upon 
each stimulus, some units will be activated and others will 
not. This approach carries with it major instructional 
implications. As Rumelhart: et al. state, 

There is no point at which it must be decided to 
to create this or that schema, Learning simply 
proceeds by connection strength adjustment. . . . 

(Vol. 2, p. 21} 

pur own research suggests that this is not the case: 
our findings indicate that the teaching of specific schema 
knowledge leads to efficient learning and problem solving 
(Marshall, _ 1987) . We provided a group of elementary school 
Children with instruction designed to create a se-t of schema 
knowledge structures corresponding to the five semantic 
relations described previous. The students learned the 
relations quickly and could differentiate them 
accurately. We hypothesize that schema knowledge provided 
the students wi :h a framework for organizing the domain of 
story problems. In this case, it was necessary to have fixed 
structures that *?ere learned as such by the students. We 
will test this hypothesis more thoroughly in the later 
stages of the present research project. 
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Schema Knowledge of Semantic Relations 



In this and the following section, we define more. 

precisely the form taken by a schema for semantic relations . 
In this section, the relations are mapped into the four 
components necessary for a schema ♦ In the next, we discuss a 
computer simulation of how schema knowledge can be used to 
solve problems. 

The basic schema configurations for the five semantic 
relations are presenttsd in Figures 5-9 (see pages xx-yy). 
Each schema is developed with respect to four components: 
t'ie necessary facts stored in long-term memory, the 
prerequisites that must exist within the problem for the 
schema to fit the situation, the goals that may need to be 
set up, and t r tf rules for using the schema to carry put the 
needed comput it ions. At this point, we make no suggestion 
about the order in which this information may be accessed by 
an individual. The four components of a schema are not 
necessarily sequential or linear in their acquisition or 
retrieval. The ordering from one to four is merely for 
convenience in describing them here. 

Declarative Knowledge 

The first component contains the declarative knowledge 
pertinent to the relation * Of primary importance is the 
representation of a typical problem. We hypothesize the need 
to store this information in two forms^ first, as given in 
the table, there* is a simple story form; second, there is a 
corresponding graphical structure that represents the same 
information. The graphs for each schema are given in Figures 
6-10. They contain visual information about possible states 
of a problem and the number of variables that may oe 
required. 

The verbal form of the typical problem represents a 
general template against which the current problem can be 
examined. It can be used as an analogy: can the current 
problem be rephrased in such a way that it matches the 
general case? Similarly, the graphical tree structure 
represents a second, more specific template. The verbal form 
allows the individual to approach the problem broadly 
without paying particular attention to the numbers and names 
used in the problem. Moving to the tree structure forces the 
individual to examine the problem, in finer detail, mapping 
the elements of the problem to specific slots of the tree 
structure^ In this way, the individual recognizes which 
parts of the structure are known, their relationships to 
each other, and which are yet to be found through arithmetic 
computation • 
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When prerequisites remain to be met* subgbals are 
created, and these subgoals must be achieved before, the 
primary goal can be addressed. _ For example, in a multi-step 
problem based upon a Change relation, itmaybe necessary to 
solve an embedded Vary or Combine problem before reaching 
the Change solution. Subgoals for solving these embedded 
relations are created by the goal-setting component of 
schema knowledge. 



The goal-setting mechanisms are the means by which the 
subgbals are established and monitored. They recognize which 
prerequisites aretpbe satisfied and in which order .They 
lisp control acceptance or rejection of goal solutions. 
Several options are available^ Oh the one hand, it may be 
possible to satisfy the condition using a variety of default 
mechanisms. On the other, it may be necessary to invoke 
another schema or additional aspects of declarative 
knowledge to gain pertinent information. 



Implementation- Rules 

Once the schema has been successfully invoked and the 
prerequisites satisfied, it can be used to solve problems . 
The way in which it is used depends upon knowledge of 
specific actions that can be taken. These actions are stored 
as product ion rules, and they act upon the various 
components of the situation as defined by declarative 
knowledge. Depending upon which cbmp nents of the situation 
are unknown* specific rules are carriu out to determine the 
value of the component. For most story problems, the actions 
are applications of arithmetic operations. 
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Figure 5 



The CHANGE Schema 




X. DECLARATIVE PACTS 



Typical problaa: *You have 
iom Moutit of iQMt hi hg , 
you git son oC it (or 
1 oae aoaa _ of _1 1 ) i and now 
you have a new amount. 

Visual tree structure: 
original amount 
amount of change 
result 

Characteristics: 

physical alteration 

permanence 

involvaa time 

associated oparationa : 

addition 
•ubtraction _ 
multiplication 

aaocciaeed uses: 
shift 

alse change 



IX. PRECOHOXTXONS 



Can problem be rephraced to 
match typical problaa? 

Can-tree structure be 
generated? 

Can 2 slots be filled? 

If 2 slots are empty, can 
1 be conatructad f real 
othar information? 

Is changa irreversible? 

Ara quant i tiaa oxpreaaed in 
tha same unit of measure? 

Can tha tiaa parioda ba 
idantifiad? 



XXX. COAX SETTING 
MECHANISMS 

Craata goal list 

Sat top level goal: 
Solve for a single 

unknown, which will 
ba 1 of 3 main parts 

Deteraine if problaa ia 
aulti step 

Sat fubgoala «i nttdtd to 

fill 2 quantity alota 

Expect subgoais to point to 
either VARY or COMBINE: 
try theae firat if naad 
to aolva aubgoals 



XV. IMPLEMENTATION RULES 



Identify original quantity, 
transferred quantity* and 
resulting, quantity. _ 
(0n« or mora will ba 
unknown. ) 

Associate quantities with 
alota 

Identify operational uaa 
associated with empty 
aiot(a) 

Map from use to particular 
operation 

Carry out operations in 
order given by goal liat 



Schema knowledge Structur 



es 



-3*- 



40 BEST COPY AVAILABLE 



ERLC 



Figure 6 
The COMBINE Schema 




I. DECLARATIVE PACTS 

Typical probles: "You have 
two group* of objects 
Ch«c cm b« jointd into « 
larger class of objects." 

visual eras structure: 
parts 

whole 

characteristics: 

hierarchical order 

no _ action _ taken 

maintain identities 

aoasociated operations: 
addition 

subtraction 

aultiplication 

associated uses: 

putting together 
taking away 



XX. PRECONDITIONS 



Can problem be rephrased to 
match typical problem? 

Caii trie structure be 
generated? 

Can 2 slots be filled? 

If 2 slots ere empty, csn 
1 be constructed from 
other information? 

la part whole relation 
present? 

Do object « aaintain 
identity after the 
combination? 

Can two quantities be 
expressed as subclasses 

of the third? 



III. COAX SETTING 
MECHANISMS 

Create goal list 

Set top level goal: 
Solve for a single 
unknown, which will 
be 1 of 3 main parts 

Determine if problem is 
multi step 

Set subgoals as needed to 
fill 2 quantity slots 



Expect subgoals to point 
to VARX: try it first 
if have subgoals 



IV. IMPLEMENTATION RO 



Identify hierarchical 
relationship: 
Hhich objects can be 
combined to form a 
superordinata catego 

Associate quantities w 
slots 

Identify operational u 
associated with enpt 
slot(s) 

Map from usr/ ?o par tic 
operation 

Carry oat operations ii 
order given by goal 
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Figure 7 
The COMPARE Schema 



value 




X. MCOOttTIV* TACTS 



typical probleai_ ,, 8oaebody 
has aore (or lass) of 
eoae thing then eoaeone 

find out ho*' 
auch eora or lass." 



visual tra» structural 
two pacta 
thair difference 

charactaristiosi acatic 

static 

no action taken 
aaintain identitiea 

associatad Operational 
subtraction 
diviaion 

aaaocia^ad uses: 
comparison 
ratio 



II. PRECONDITIONS 



Cnn prbblasi ba raphraaad to 
Match typical problea? 

Can t raa atructura be 
ganerated? 

Can 2 slots be filled? 

It 2 sloes aid «*tpty. can 
1 ba constructs f roa 
otbar information? 

Ara quantities espreeeed 
in sama unit of -meaeura? 



Do object a maintain 
identity after the 
coaparison? 



III. GOAL SEWING 
MECHANISMS 

Craata goal list 

gat top laws 1 goal: _ 
Solve Cor tha unknown 
which wilibe tha 
else of tha coapaciaoh 
or tha idantity of tha 
largar ot eaaller 

Deteraina if probiea ia 
aulti atap 

Sat subgoala as naadad to 

fill quantity slotn 

Expect aubgoala to point 
to VARY or COMBINE 
try thaaa firat if hava 
aubgoala 



IV. IMPLEMENTATION RULE* 



Tdantify two quantities 
that ara to ba compared 

Associata quantitiaa with 
olota 

Identify operational uaa 
associated with empty 
slot(s) 

Map froa uaa to particular 
operation 

Carry out oparationa in 
order given by goal liat 
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Figure 8 
The VARY Schema 




ype 



X. DECLARATIVE FACTS 



typical problem* "You.bavt 
some number of object! - 
Each has a fixed value. _ 
If the number of objects 
Changes, the total value 
changes also.** 

visual tree structure: 
number of objects 
fixed value 
new number of objects 
total value 

characteristics: 
static 

no action taken 
maintain identities 

associated. operations : 
multiplication 
division 

associated uses: 

acting across 
rate 

factor/divisor 
recovering factor 



II. PRECONDITIONS 



Can problem be rephrased to 
match typical problem? 

Can tree structure be 
generated? 

Can 3 of 4 slots be filled? 

If 2 slots are empty, can 
1 be constructed from 
other information? 

Are objects expressed in 
a common unit of 
measure? 

Are values expressed in 

an unit? 



Is the function relating 
a unit and a value the 
same in ail cases? 

Is concept of "per unit" 
expressed? implied? 



III. GOAL SETTING 
MECHANISMS 

Create goal list 

Set top level goal; 
Solve for a single - 
unknown, which will 
be 1 of 4 main parts 

Determine if problem is 
multi step 

Set subgoals as needed 
to fill 3 quantity slots 

No expectations about 
subgoals. 



IV. IMPLEMENTATION RULES 



Identify objects. . 
Identify per object value 
Ide_nti_fy_ relation between 
objects and values 

Associate quantities with 
Slots 

Identify operational use 
associated with empty 
slot(s) 

Map. from use to particular 
operation 

Carry out operations in 
order given by goal list 
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Figure 9 



The TRANSFORM Schema 




V* 



X. 0CCULAATXVS FACTS 



typical pro&Iesu "Two 
things era sxprsssed in 
the same unit ot Mtiun, 
Keen of tUM can be 
sxprssssd _e* a function 
of the other om.' 



XI. MCCOttOXTXOHS 



XZZ. 



GOAL SCTTXNQ 

juauuxiM* 



visual trta itruotum 
two t hinge 
relational function 

characteristics! 

etetic 

no action taken 
in»*ol vaa relation 

bet we* a 2 objects 
involve, pert/whole 

aaaociat ed coo ra t ion t 
addition 
subtraction 
multiplication 
division 

ssnoclstsd uses* 
alia change 

ratio 

siss chanoo 

divisor 

cosjpsrisos 



^•_P?°bl#* rsphrassd to 
eat eh typical problem? 

Can_tree_ structure on 
generated? 

are that e 2 distinct things 
with possible nuasrical 
vsluss? 



Are the thinge esproooeblo 
in tat mm unit of 
aeaaure? 

Xn ons of ths things eteted 
la tsraa of ths other? 

things bs 
considered to be a 
•whole- so that ths othsr 
one can bs nsasursd 
against it? 



Crsats goal list 

not top level goali. 
folve for a single 
unfcnoMi, which will 
ho I of 3 main porta 

Deteraine if problsa is 
sulci stsp 

subgoals as needed 

to fill 2 quantity slots 

*o eapeetation about 
aocondary a tape. 



H*ely to bs ths first 
stsp in oultl stsp 
pro* Is* 



XV. IMPLEMENTATION BULKS 



identify ths 2 objecte to 
bo related. 

Xdsntify ths function that 
rnlatss ths 2 objects 

Xdsntify ths object to 
bo expreeeed in tsraa 
of the othsr 

Xdsntify operational uss 

Map fro* operational uaa to 
operation 

Carry out operation* in 
order given by goal li»t 
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Computer Modelling of Problem Solving 

In this section, we describe computer yodels of the 
semantic relations of Change, Combine, Compare, Vary, and 
Transform. Each is implemented according to the definitions 
and constraints developed above. The sufficiency of our 

specifications is evaluated in terms of the computer 

simulation's success in determining the correct relation for 
sdiving a set of simple story problems. 

The section has the following outline. First* the form 
in which story problems are presented to the system is 
described. Second, the general characteristics of the 
computer models are given > tbget! er with examples of the 
different semantic relations. Finally, we discuss several 
issues that remain unresolved. 

Propositional Encoding 

The creation of a computer system that can parsestqry 
problems stated in natural language is beyondthe scope of 
this project, Consequently, we (like others) rely upon 
propositional encoding of thestory problems, and our 
computer programs operate upon these propositions. The bases 
for encoding problems into propositions are described here 
together with examples of several story problems. 

Our objective is to . represent the seriantic relations 
contained in a story problem and not the individual meanings 
of words. This is an important point, because it means that 
the computer model can operate with a reasonable but 
restricted base of wcrld knowledge. For our purposes, many 
semantic labels will be indistinguishable. For example, the 
semantic differences between pieces of furniture such as 
chair, table* or desk are unimportant. Our system would note 
only that these are allinstances of furniture and are 
different from each other. The ways in which they are 
distinct are not (usually) significant factors in solving a 
story problem. Similarly, the system would know that apples, 
oranges, lemons, and bananas are all types of fruit. Unless 
additional characteristics are required by the problem 
statement le.g., unlessa question such as "how many pieces 
of yellow fruit are there in the basket"?), they are not 
represented in the knowledge base. 

In like mahrier> several actions that occur within a 
story setting have a common meaning. For example, there are 
many ways to express the notion that an individual possesses 
something: has, owns, keeps, holds, and grasps are only a 
few. In most instances, the differences between these terms 
are not critical to the semantic relation expressed in the 
problem; they could be interchanged without loss of 
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understanding. Therefore, it is possible to rely upon a few 
primitive verbs to express the acts described in story 
problems. Again , this simplifies the knowledge base used bv 
the computer models." * 

Each story problem to be solved by the computer 
simulation is encoded as a set of propositions. These 
propositions contain all relational and numerical 
information present in the problem (whether relevant or 
not)^ They do not necessarily contain all extraneous details 
of the situation. For example, consider t.ie problem: 

On her way to work, Mary found $5.00 on the (31) 
ground. She picked it up and put it in her 
coat pocket. She already had $16.00. How 
much money does she have now? 

The relevant propositions for this problem need to capture 
the information that Mary had some money with her and that 
she acquired sane additional money. Several del-ails of the 



problem are unimportant with respect to the underlying 
semantic relation, for example, it is not necessary t6 know 
that^Mary was on her way to work — her destination has 
little to do with the structure of the problem. Similarly, 
it is unimportant in this problem to know in which pocket 
she put the money. v 

The General Form-bf Propositions 

Each proposition is an expression composed of five 

elements i 

[subject primitive object direction time] 

The subject is the central character or actor of the 
proposition. It may have a type attribute, a name attribute, 
and/or an associated numerical value. For example, the 
subject of a proposition might be "Three boys". In this 
case, boyis the type, and the numerical value is 3. Name is 
unassigned because the boys' names are not given. Typically, 
only the type and/or name is given. * y ' 

The primitive defines the action of the proposition. As 
described above, this is generally a class of actions such 
as possess or transfer. 

The object functions in a proposition as a direct 
object. Like the subject, it may have a type attribute, a 
name attribute, and/or an associated numerical value. 
Usually the type attribute and numerical value suffice to 
describe the object (e.g., 15 cookies, 4 lemons). 
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The primitive may have directionality. For example* 
objects may be transferred to, or f torn the subject. The 
element direction contains this information in two 

parts. Firft, the actual direction (to or fromj is__ 

specif ;ied. Second/ the recipient of the action and direction 
is given. The recipient is similar to a indirect object in 
traditional grammar, but it also allows representation of 
passive statements. Thus, we can represent "Mary gave John 
10 apples" by 

[Mary transfer (10 apples) (to: John) ...] 

with Mary the subject, transfer the? primitive, {10 apples) 
the object , and (to: John) the direction. In this case, the 
recipient John is the indirect object of the transfer . Had 
the problem been stated as "John was given 10 apples by 
Mary", we have a different proposition: 

[John transfer (10 apples) (from: Mary) ...] 

in which the recipient contains information about the origin 
of the transfer. 

It is obvious that one could always force passive 
statements into a prepositional form cf an active kind (such 
that the subject always performs the action). Such a policy 
distorts the structure of the problem. By allowing both 
passive and active statements through the directionality 
specification/ we preserve as closely as possible the way in 
which information is presented in a problem. 

The fifth element of a proposition is time. This also 
has two parts, one which denotes in a general way whether 
the action occurs in the present, past or future, and the 
second which is a label (such as the day of the week). Verb 
tenses are encoded in the first part? primitives are always 
expressed as infinitives. 

Ty, i of Propositions 

There are three types of propositions : state, event , 
and query. State propositions reflect a constant state of 
the world. Two kinds of state propositions may be nade; The 
first describes attributes belonging to the subject . The 
primitive in this case typically is "possess" and the 
proposition indicates that a subject has some object. The 
second kind of state proposition is used to specify group 
membership and the primitive is "is", such as "George is a 
boy". 

An event proposition denotes a change in possession or 
some other action in which objects or subjects gain or lose 
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numerical value. The elements of time and direction are 
important for event propositions because they define when 
the event occurred and to whom. 

Finally, a query proposition reflects that some element 
or part of an element is unknown and asks for information 
about that unknown. In most cases, the unknown is a 

numerical value associated with an object or subject. 

Occasionally, the unknown is a particular object or subject 
(as in the situation where we want only to discover which of 
two individuals has the most or least of something). 

Computer Implementation 

The computer models were developed in PRISM, a computer 
system implemented in InterLispD for use on Xerox D- 
machines. Created by Pat Langley, PRISM is a system that 
facilitates construction of production systems within the 
InterLisp environmint (Langley & Neches, 1981; Ohlsson & 
Langley, 19861. PRISM is especially well-suited for the 
current project because it distinguishes between working 
memory and long-term memory and allows generation of 
alternative architectures for production systems. Thus, it 
can be altered as necessary for any particular programming 
task, and We have made modifications to allow representation 
of the declarative and procedural knowledge used in our 
simulations. 

We define three parts of the system: working memory, 
procedural memory, and declarative memory. Working memory 
contains all the information that is active in the -system at 
any given moment. Procedural memory consists of a set of 
production rules that either identify_ the underlying 
relation or take appropriate action once the relation has 
been specified. Declarative memory is maintained as a 
semantic network. At this point, the connections or links 
between elements in the network are primarily thosi* 
describing inheritance, such as "apple is a fruit" or "boy 
is a child". We anticipate that other links will be created 
as we develop a more complete system. 

To solve a problem, the system operates only upon 
working memory (WM) , which is initially empty. At various 
times, it iray contain elements from incoming stimuli (e.g., 
pieces of the problem) or elements from long-term memory 
that have been activated by production rules. 

When a problem is presented* each proposition of the 
problem enters working memory. As the system encounters the 
propositions in WM, it checks to see whether certain 
relationships are present. In order to solve a problem, the 
system must recognize the semantic relation that underlies 
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it,. Recognition is achieved by the production rules which 
operate upon the propositions, and it may entail a search in 
and activation of portions of declarative knowledge. For 
example, a proposition may contain reference to girls, boys, 
and chiidrenj. To determine the relationships that exist 
among these three categories, the system searches 
declarative memory and discovers that c^ildrenis a class of 
objects that can be decomposed into two subclasses, boys and 
girls. This information is added to working memory, and it 
leads to the satisfaction of a primary constraint of the 
Combine relation (see Table 3). 



The recognition set of production rules correspond 
roughly to the preconditions established for each schematic 
representation of a semantic relation. These rules are based 
upon constraint matching rather than upon key words found in 
the propositions. When the constraints are satisfied, the 
system "recognizes" the embed relation in the problem and 
puts that information in working memory. Thus, in the_ 
example above, a statement identifying the Combine relation 
is added to working memory. 

Once a_ relation has been identified and labeled in 

working memory, the system, activates schema-based rules and 
attempts to carry out the necessarycomputations. To do 
s 9* it interprets information presented in the propositions 
with respect to the general framework of the schema that has 
been invoked. Thus, these rules contain information about 
the number of quantities that must be already known and how 
to find values for those that are unknown. 

Table 7 contains an example of how the system solves 
one problem. At the top of the Table, the problem statement 
is given, followed by a set of propbsitional encodings. The 
first three propositions are presented to the system as the 
original encodings; the last two are created by the system 
as it solves the problem. The_ lower portion of the Table 
illustrates the pirobiem-solving steps that are required for 
solution by the system. The steps are represented in Table 7 
as cycles . Under the current constraints of PRISM, each 
cycle culminates in the firing of one production rule. 

In the first cycle, working memory (WMJ contains three 
propositions: Pi, P2, and P3. Several conditions pertain to 
this situation described by these propositions. First, a 
transfer of some given amount has takea place and the result 
of that transfer isunknown. The transfer occurs for a 
subject (in this case, Sally) and the unknown result also 
belongs to Sally- The object being transferred is money, 
expressed in dollars. Finally, no semantic relation has yet 
been identified, and no schema rules have been called. Only 
one production rule maps into these conditions. The rule 
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takes the action of identifying the relation as a CHANGE and 
places that identification into WM as scheraa=CH. 



In Cycle 2, WM contains the original propositions plus 
the schema identification. The conditions present in Cycle 1 
are also present in Cycle 2, and the constraints to be 
matched are t:ie same With the exception of the known 
relation. Again, only one production rule satisfies the 
constraints. The resulting action is to execute the transfer 
of P2? that is, a new proposition P4 is created in which the 
subject. now possesses the objects. P4 is added to WM. The 
original proposition indicating transfer is now obsolete and 
is deleted from WM. 



Following the addition of P4 to WM, the system now 

recognizes that one subject possesses two known quantities. 
However , these possessions are recorded at different times. 
The next cycle checks that one of the possessions occurs 
earlier than the other and that it can be logically inferred 
that this possession is unaltered when the second possession 
takes place. If this is the case (as demonstrated in Cycle 
3 of Table 7), the time of the_first possesion is upu^ted. 
We represent this updating in proposition P5. The original 
information is now incorrect and is removed from WM f leaving 
three propositions: P3, F4, and P5. 

InCycle 4, the system solves for the unknown quantity. 
The conditions to be methere are that an identified subject 
possesses two different amounts of an object expressed in a 
standard unit and that the total of these amounts is 
unknown. The total is computed and inserted into Proposition 
3, replacing the unknown ??? with the computed value. 
Propositions P4 and P5 are removed immediately from WM and 
are irretrievable once the aggregation takes place. 

Under PRISM architecture, the system continues to run 
until ho production rule can be executed. Thus, the final 
cycle of any problem-solving endeavor looks like Cycle 5 of 
Table 7. No productions are acceptable, no constraints are 
evaluated, and execution terminates. 

The existing rules and semantic network are sufficient 
for solving a set of twenty simple problems we have used in 
this and in other research about semantic relations 
(Marshall, 1987) . These problems are presented in Table 8. 
All five semantic relations are present in this set, with 
four exarnpies of each. The system reaches correct 
identification of each relation and uses schema knowledge to 
solve the problems. 
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Table 7 

An Example of How the System Solves a Change Problem 



PROBLEM STATEMENT: 



Sally had §2.53 in her pocket when she went out for 

a walk. She found $.35 oft the sidewalk and put it 

in her pocket. How much money did she have at the 
end of her walk? 



PROPOSITIONS ENCODING: 

PI state proposition:. 

subject: Sally 
primitive: possess 

object: money (quantity .1.53) (unit dollar) 

direction: nil 
time: 0 

P2 event proposition:. 

subject: Sally_ 
primitive: transfer 

object: money (quantity .35) (unit dollar) 
direction: to Sally 
time: 1 

P3 state proposition: _ 

subject: Sally 
primitive: possess 

object: money (quantity ???) (unit dollar) 

direction: nil 
time: 2 



P4 state proposition: 

subject: Sally 
pr ixhit Ive : possess 

object: money (quantity .35) (unit dollar) 

direction: nil 
time: 1 



P5 state proposition: 

subject: Sally 
primitive : possess 

object: money (quantity 2.53) (unit dollar) 

direction: nil 
timet 1 
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Table 7 continued 



PROBLEM SOLUTION: 



CYCLE 1: 
Elements of WM: 



Constraints satisfied: 



Action taken: 



PI 
P2 
P3 



transfer occurs (P2 in WM^ 

result of transfer unknown (P3 in WM) 

subject , objects r and units of 

objects are constant 
transfer time earlier than result 
schema unknown 

add schema=CH to WM 



Cycle 2: 
Elements of WM: 



Constraints satisfied: 



Action taken: 



scheraa=CH 

PI 

P2 

P3 

schema 3 CH 

transfer occurs (P2 in WM) 

result of transfer unknown (P3 in WM) 

transfer time earlier than result 

carry but transfer 
create P4 
add P4 to WM 
delete P2 from WM 



CYCLE 3: 
Elements of WM: 



schema=CH 

pi 

P3 
P4 



Constraints satisfied: 



Action taken: 



schema=CH 

possessions occur at different times 

update originial possession time 
create P5 
add P5 to WM 
delete PI from WM 
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Table 7 continued 



CYCLE 4: 

Elements of WM: scheuia=CH 

P3 
P4 
PS 

Constraints satisfied: two possessions occur 

the times of possession are the same 
the total possessed is unknown 
schema=CH 

Action taken: solve for ??? in Proposition (3) 

modify P3 in WM 

delete P4 from WM 
delete P5 from WM 

CYCLE 5: 

Elements of WM; schema-CE 

P3 [modified] 

Constraints satisfied: none 

Action taken: no production acceptable 
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Table S 

Examples of Semantic Relations in Arithmetic Story Problems 
1. CHANGE RELATION: 



For Hallpween # Sue gut 35 candy bars into a large bowl 
for children who came "trick-or-tr eating" at her house. 
She decided that this might not be enough, so she put 
another 12 candy bars into the bowl. How many candy bars 
were in the bowl? 

Sally had $2.53 in her pocket when she went out for a 
walk. She found $.35 on the sidewalk and put it in her 
pocket* How much money did she have at the end of her 
walk? 

Peter bought 45 cookies for the school party. On the way 
to school, he got hungry and ate 7 of the cookies. He 
took the rest of the cookies to school for the party, 
low many cookie3 did he contribute to the party? 

Before the volleyball game, there were 40 towels for the 
players to use. After the game, the coach could find 
only 28 towels. How many towels disappeared during the 
game? 



2. COMBINE RELATION: 

At Evans Elementary School/ there are 15 members of the 
boys' basketball team and 18 members of the girls* team. 
How many students are on basketball teams? 

Jodi makes $12.50 a week on her paper route and $3.45 a 
week for doing chores at home. How much money does Jodi 
earn each week from these two activities? 

At the track meet, there are 83 competitors; 31 of them 
are boys. How many are girls? 

Jerry made fruit salad with apples and bananas. He made 
6 1/2 cups of salad i If he put 3 3/4 cups of bananas in 
the fruit salad, how many cups of apples did he use? 
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3. COMPARE RELATION: 

When Chef Jack cooks a roast beef, he bakes it in the 
oven for 3 hours. When he prepares roast chicken, he 
only cooks it for 1 1/4 hours. How much longer does a 
roast beef cook than a roast chicken? 

The best gymnast at Central High is Mary. In the last 
gymnastics meet, she scored 8.1 on the balance beam 
exercise and 9.4 on the vault > How much better did she 
do on the vault than on the balance beam? 



Mount Ranier, in the state of Washington, is 14,408 feet 
high. Mount Washington, in New Hampshire, is 6288 feet 
high. How much higher is Mount Ranier than Mount 
Washington? 



Jeff earns $5.50 per hour at his job, but George only 
makes $4.25 per hour at his job. How much less per hour 
does George make? 



4. VARY RELATION: 

Kevin plays on the school baseball team. Every time any 

player on his team hits a home run, the coach gives the 
piayer 3 baseball cards. Kevin hit 7 home runs this 
year. How many baseball cards did the coach give Kevin? 

One pound of potatoes cost $.35. What would five pounds 
cost? 

Mark's grandfather is 85 years old today. Mark's mother 
knows that she can't put 85 candles on his birthday cake 
(because they won't fit). She decides to use 1 candle 
for every 5 years of Grandfather's age. How many candles 
should she put on the cake? 

Sheila likes to make 3 pitchers of lemonade at once. To 
do this , she uses 24 cups of water. How much water would 
she need to make only 1 pitcher of lemonade? 
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Table 8 continued 



5. TRANSFORM RELATION: 



In a football-kicking contest, Joe kicked the ball 13 
yards farther than Sam kicked it. Joe kicked the ball 30 
yards. How far did Sam kick the ball? 

Albert spent $3.75 at the school fair. Mike spent 4 
times as much as Albert at the fair. How much did Mike 



Alice's mother is three times as old as Alice. If her 
mother is 45 years old, how old is Alice? 

Cindy has $4.67. Her friend Bill has $.35 more than 
Cindy. How much money does Bill have? 
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Unresolved Issues 



We have riot yet implemented the goal mechanisms 
required for full schema representation. The sets of rules 
described above operate successfully oh a small number of 
multi-step problems we hava presented to the system thus 
far, but they do so by determining which schema can be 
invoked with current information rather than by identifying 
the top-level schema of a problem; For example, in a multi- 
step problem such as the one diagrammed in Figure 8, the 
current system would not recognize that the problem was 
essentially a Change relation with other relations embedded 
in it. Rather, it wouldfirst solvea vary problem then look 
to see if other problems needed to be solved. It would find 
that it could solve the Combine problem and would do so, 
still without. identifying the need to solve the Change 
problem. Finally, after solving both the Vary arid Combine 
subproblems, the system would address the Change relation 
and would carry out the remaining pperationand reach a 
final solution i Thus, the system can solve multi-step 
problems but it does so without using goals. If information 
were presented in the problem that could be used to 
formulate a relation that was actually unnecessary for 
ultimate solution, the system would be misled and would 
engage in solving an irrelevant problem. 

Most of our attention to date has been on the 
representations necessary to identify and invoke a single 
schema. We are now working on the goal-setting mechanisms. 
We are also engaged in loosening the constraints that 
identify different relations. This will allow weak 
identification of a relation and its associated schema and 
will permit us to begin to model ways in which each schema 
may be inappropriately instantiated. 

Summary 



The computer implementation provides support for the 

schema structures developed here. Using the specifications 
of the relations and the components of schema knowledge, the 
computer programs successfully identify and solve a variety 
of story problems. The next phase of our research will be to 
develop an instructional environment in which the elements 
of relational and schematic knowledge can be clearly 
demonstrated, manipulated, and isolated. 
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